CLUSTERING OF MARKOV CHAIN EXCEEDANCES 



SIDNEY I. RESNICK AND DAVID ZEBER 

Abstract. The tail chain of a Markov chain can be used to model the dependence between extreme 
observations. For a positive recurrent Markov chain, the tail chain aids in describing the limit of 
a sequence of point processes {Nn,n > 1}, consisting of normalized observations plotted against 
scaled time points. Under fairly general conditions on extremal behaviour, {Nn} converges to a 
cluster Poisson process. Our technique decomposes the sample path of the chain into iid regenerative 
cycles rather than using blocking argument typically employed in the context of stationarity with 
mixing. 



1. Introduction 

One of the effects of dependence in a time series is that extremes tend to cluster. This has apphed 
impUcations to risk contagion over time but is also mathematically interesting and the challenge is 
to precisely relate the dependence structure to the clustering. For a dependence as structured as 
Markov dependence, how do we describe exceedance clusters? 

Point processes powerfully describe extremal behaviour of certain time series. Under appropriate 
conditions on the law of a process {Xj : j > 0}, the exceedance point process Nn defined by 

(1.1) Nn{[0,s] X (a,oo]) = #{j < sn : Xj > abn} 

typically converges weakly to a Poisson limit as n — > oo, where 6„ — > cx) is a threshold sequence. This 
leads to a number of results on asymptotic distributions of large order statistics and exceedances of 
an extreme level. Such results have been developed in a variety of contexts by [3, 6, 10-12, 14, 21]. 
More specific results exist for regularly varying processes [4, 7], regenerative processes [1, 27], and 
Markov chains [22, 35]. Distributions of functionals of such point processes have been considered 
in [28, 29, 36]. 

For stationary processes, the dependence structure causes extremes to occur in clusters. The 
clustering is often summarized using the extremal index 9 introduced by Leadbetter et al. [14], 
which is related to the asymptotic mean cluster size. To obtain a point process convergence result, 
authors often employ the big block/little block technique and mixing conditions, such as Leadbetter 
et al.'s condition D{un) [14], to split the process into approximately independent and identically 
distributed blocks. With an appropriate choice of block size, extremes within one such block belong 
asymptotically to the same cluster. Under an assumption controlling the extremal behaviour within 
each block, such as via the distribution of the number of exceedances, generally converges to 
a limiting compound Poisson process, where the compounding at each time point approximates 
the clustering within each block. For Markov chains, the tail chain is an asymptotic process that 
models behaviour upon reaching an extreme state; see [22, 23, 26, 30-32]. Point process results for 
stationary Markov chains employ the tail chain to specify the compounding in the limit process. 
Under Markov dependence, the within-block behaviour is determined merely by conditions on the 
marginal distribution and the transition kernel. Basrak and Segers [4] extended the tail chain model 
to general multivariate regularly varying stationary processes. 
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Rootzen [27] focuses on regenerative processes, which spht naturaUy into cycles. In this case, 
the within-block condition is replaced by an assumption on the extremal behaviour over a cycle. 
The main difference is that the cycles are of random but finite length, whereas the block size 
increases deterministically with n. In particular, Rootzen shows that convergence of the sequence 
of processes counting the number of exceedances depends on the asymptotics of the distribution of 
the cycle maximum as well as the marginal distribution. 

We combine these two approaches to derive the weak limit of {A^^^} when {-'^n} is a positive 
recurrent Markov chain. Such chains display a regenerative structure and in the limit, Nn is 
approximated by a process consisting of clusters of points stacked above common time points, 
each corresponding to a separate regenerative cycle. The heights of the points in each cluster are 
determined by an independent run of the tail chain. We use distributional results for the tail chain 
process in [26] and summarize needed in Section 2. We focus on the case of heavy-tailed marginals, 
but believe our approach could be extended to accommodate more general marginal distributions. 

1.1. Notation and Conventions. We review notation and relevant concepts. In general, bold 
symbols represent vectors or sequences and for x = {xi,X2 ■ ■ write '■= {xi, . . . , Xm)- 

the left-continuous inverse of a monotone function /, i.e., 
f^{x) = inf{y : f{y) > x}. 
RVp the class of regularly varying functions with index p. 

D[0, oo) the space of real- valued cadlag functions on [0, oo) endowed with the Skorohod 

topology; L'ieft[0, oo) are the left continuous function. 
D^[0, oo) the subspace of Dioft[0, oo) consisting of non-decreasing functions / with /(O) = 

and lima,^oo fix) = oo. 
/C(E) the collection of compact subsets of E. 

C(E) the space of real-valued continuous, bounded functions on E. 

C^(E) the space of non- negative continuous functions on E with compact support. 

M_|_(E) the space of non-negative Radon measures on E. 

Mj,(E) the space of Radon point measures on E. 

LEB Lebesgue measure on M. 

PRM(^) Poisson random measure on E with mean measure p,. 
ex{-) point measure at x, i.e., ex{A) = 1a{x)- 

Ua a measure on (0, oo] given by z^q(0, oo] = x~" for x > 0, q > 0. 

=^ weak convergence of probability measures. 

For a space E which is locally compact with countable base (for example, a subset of [— oo, oo]"^), 
a sequence of measures {/^n} C M+(E) converges vaguely to /i € M+(E) (written /i„ A p) if 
f^fdpn f^f dp as n ^ oo for any / € C^(E). The vague topology on M_|_(E) is metrizable 

by the vague metric, d^, i.e., dy{pn-,l^) ^ iff /u„ A /u. See [13, 20, 25] for further details. A 
distribution F on [0,oo) has a regularly varying tail with index a > 0, denoted 1 — € RV_q, if 
there exists h(t) — >■ oo such that 

tF{b{t)-) in M+(0, oo] as i ^ oo, 

where fa(x, oo] = for x > 0. The function b[-) is called a scaling function. 

If X = (Xo,Xi,X2, . . . ) is a (homogeneous) Markov chain and K is a, Markov transition kernel, 
we write X K to mean that the dependence structure of X is specified by K, i.e., 

P[Xn+i G • I X„ = x] = K(x , •), n = 0, 1, . . . . 

Also, P^[X G •] specifies the initial distribution P[Xq G •] = /i (abbreviate Px '■= Pe^,), and 
denotes expectation with respect to P^. 
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2. Extremal Component and Tail Chain Approximation 

Let X = {Xq, Xi, . . . ) be a Markov chain on [0, oo) with transition kernel K. The tail chain is 
a finite-dimensional approximation to the chain X used to study the limit of {Nn} given by (1.1). 
Building on theory developed in [23, 30, 31], [26] presents the tail chain approximation in terms of 
a related process known as the extremal component of X, an approach we follow here. 

2.1. Tail Chain Approximation. Suppose the transition kernel K is in the domain of attraction 
of a distribution G (denoted K € D{G)) which means [26], 

K{t,t-)^G{-) on [0, oo] as t -?■ oo. 

Taking (,i,^2,--- iid random variables with common distribution G, set ^(n) = f^J'^^^j, n > 1 
with ^(0) = 1 and write ^ = {(,{n),n > 0}. The tail chain associated with G [22, 26, 31] is 
T = (ro,ri,...) givenby 

(2.1) Tn = To6 • • • = To^{n), n > 0. 

Thus T is a multiplicative random walk and {0} is an absorbing barrier for T, accessible if G({0}) > 
0. How does the tail chain approximate the extremal portion of X? 

An extremal boundary for X is a function y(t) satisfying < y{t) — )■ 0, such that 

(2.2) K{tut, t[0,y{t)]) — >G{{0}) as t ^ oo, 

for any non-negative function ut = u[t) — )• u > 0. Such a function always exists \l K ^ D(G) 
[26, Section 3.2]. If y{t) is an extremal boundary, any function < y{t) — > with y{t) > y{t) for 
t > to is also an extremal boundary. If G({0}) = 0, then y(t) = is a convenient choice. Given 
an extremal boundary for K, the extremal component of X is the process X zeroed as soon as it 
crosses below the scaled extremal boundary. The first downcrossing occurs at time 

(2.3) T{t)=mf{n>0:Xn<ty{t)}, 

and the extremal component is the process = {X^^\x[^\. . . ) defined by 

= ■ l{n<r(i)}> n = 0, 1, . . . . 

Starting from an extreme level Xq = t, the extremal boundary separates extreme states from 
non-extreme states for the scaled process t-^x. 

The tail chain approximation is the following [26, Theorem 3.3]. For K € D(G), u > 0, m > 1, 

(2.4) Pt„[t-i(xP,...,xW) £.] ^P„[(Ti,...,r„)G-] on [Coo]'" as t ^ oo. 

So, the tail chain maps extreme states onto (0, oo) and contracts non-extreme states to the point 
{0}. Note r(t) = inf{n > : X^*^ = 0}. 

If the finite-dimensional extremal behaviour of X is completely accounted for by the extremal 
component, then the tail chain approximation (2.4) extends from X^*^ to X. When is this the 
case? Say that K satisfies the regularity condition if for any non-negative function ut = u{t) 0, 

(2.5) K{tut , t-) =^ eo(-) on [0,oo] as t ^ oo. 

Equivalent forms of (2.5) exist in [26, Section 4], and a relatively easy-to-check sufficient condition 
is given in terms of update functions. If either (a) y{t) = is an extremal boundary; or (b) K 
satisfies the regularity condition (2.5), then for u > 0, we strengthen (2.4) to [26, Theorem 4.1], 

(2.6) Pt« [t-^(Xi,...,X^) € •] ^ P„ [(ri,...,T„) G •] on [0,oo]™ as t ^ oo. 
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2.2. Finite-Dimensional Convergence. The conditional approximation (2.4) requires that the 
initial state become extreme. Combining (2.4) with a heavy tailed initial distribution makes X*-*^ 
have an unconditional distribution that is regularly varying (in a sense to be discussed) with a 
limit measure determined by the tail chain. Depending on assumptions, convergences take place 
on := (0, oo] x [0, oo]™ or the larger space E* := [0, 00]™+^ \ {0}. 

Theorem 2.1 ([26, Proposition 5.1 (b). Theorem 5.1]). Let X be a Markov chain on [0,oo) with 
K € D{G), and suppose Xq ~ H , with 1 — H ^ RV_q with scaling function b{-). On E^ define the 
measure 

(2.7) li{dxQ, dxm) = i^a{dxo) Pxo [(Ti, . . .,Tm) e dXjn] , 

and extend this to a measure fi* on E* by defining fi*{- flE^) = fi{-) and ;U*(E*\E^) = 0. For any 
m > 1, the following convergences take place as t ^ 00. 

(a) In M+((0,oo]'" x [0,oo]), 

P [(Xo, . . . , X„)/6(t) G (•) f|(0, 00]™ x [0, 00]] A m((-) f]{0, X [0, 00]) , 
and in M+(E^) 

(2.8) tP[(xr^...,^i^(*)V&We-] ^M-). 

// either (i) G({0}) = 0; (ii) y{t) = is an extremal boundary; or (Hi) K satisfies the 
regularity condition (2.5), then (2.8) can be strengthened to 

(2.9) tP[(Xo,...,X^)/6(t) G • ^m(-), mM+(E^). 

(b) In the bigger space E*, we have 

(2.10) iP[(x('(*»,...,x(?W))/fe(i)G-] ^^*(-) in M+(E*) 
if and only if 

(2.11) E^f < 00 and tP [x]''^*^Vft(0 e •] ^ (ECi")^'i^a(-) mM+(0,oo], j = l,...,m. 

Part (a) requires that the first observation is large and with added conditions, part (b) removes 
this requirement. Regardless of whether (2.11) holds, the limit is always a lower bound on the tail 

weight of xj*^ , since 

liminftP rxf^*^VK*) >x\> /i((0,oo] x [0,00]^"^ x (x, 00] x [0,00]'"-^) = (ECD^x"" 
by (2.8) and Lemma 2.1 below. Markov's inequality, (2.8) and a moment condition: 

(2.12) 3e>0 such that limlimsup t E [(xj^^^^^V^l*))" l{Xo<5fe{t)} ] = j = l,...,m, 

imply (2.11). See [16]. 

Here is a formula that helps evaluate /i in (2.7) on certain sets. 

Lemma 2.1. For a random variable Y, define the measure v{dx,dy) = Va{dx)P[xY G dy] on 
[0,oo]2\{(0,0)}. We compute 

(2.13) i.([0, x] X (y, 00]) = y-" E [y" l{y>,,-i} ] - x"" P [y > yx-^] . 
In particular, z^([0,oo] x (y,oo]) =y~"Ey°. 



MARKOV CHAIN EXCEEDANCES 



5 



Proof. We obtain 

i/([0, x] X (y, oo]) = / u^{ds) ?[Y > ys^'] = [ ds P[Y > ys^/"] 

J[0,x] J[x~'^,oo] 

= [ (isP[y-"y" > s] 
by change of variables. Applying Pubini's theorem, this becomes 

/ (s- x-^) P[y-°y- e ds] = E [Y^ l{y>,,-i} ] - x"" P [y > yx-^] . 

J(x-°',oo] 

Letting x — > oo, this quantity converges to y~°' EY" by monotone convergence. □ 

2.3. Maximum of the Extremal Component. We give conditions on the extremal component 
which enable an informative point process limit result by controlling the positive portion of the 
extremal component, the random vector of random length {^j*^ : j = 0, . . . , r(t) — 1} = {Xj : j = 
0, . . . , r(t) — Ij.The conditions imply restrictions on the behaviour of the tail chain T. 

We study a positive recurrent chain X by splitting it into regenerative cycles and analyzing 
its extremal properties via the extremal components of the cycles. For regenerative processes, 
Asmussen [1] and Rootzen [27] point out the connection between point process convergence and 
the asymptotic distribution of cycle maxima. Informed by this approach, we consider when the 
distribution of the maximum over the extremal component has a regular variation property. The 
limit measure of this regular variation can be used to compute an extremal index for X [27]. 

Here is a condition that controls the persistence of non-zero values of the extremal component: 

(2.14) lim lim sup P [ sup xf^*^^/b{t) >a\Xo> 6b{t)] = for all a,6>0. 

Note supj>^ xj''^*^'' = sup„<j<^(^(j)) Xj^ l{r(b(t))>m} • Compare this condition with [4, Condition 
4.1] and [22, Equation (3.1)], which are formulated in terms of block sizes. Condition (2.14) is 
a tightness condition that complements the finite-dimensional convergences (2.8). Section 3 gives 
simpler sufficient conditions depending on whether G({0}) > or G({0}) = 0. Condition (2.14) 
requires that the chain drift back to the non-extreme states after visiting an extreme state and 
makes non-extreme states recurrent and the tail chain transient. 

Proposition 2.1. Let X ^ K ^ D{G) he Markov on [0, oo) with initial distribution H satisfying 
1 — H ^ RV-d with scaling function b{t), so that (2.8) holds. If X satisfies Condition (2.14), then 



(2.15) lim P sup^(j)>a =0, a > 0, 

m-^oo L J 

and ^(n) — )• as n ^ oo in probability and almost surely and therefore, 

(2.16) p[r„ ^ 0] = 1. 

So the tail chain is transient and the additive random walk {logr„}„>o satisfies logT^ —oo. 
The tail chain T and X live on the same state space [0, oo) but for T, {0} is a special boundary 
state which represents the collection of non-extreme states of X under the tail chain approximation. 

Proof. Observe from (2.8), as t ^ oo, 
tP 



Xo > b{t), sup X]*^*^^ > bit)] [ v^{dx) P^ [ sup Tj > 1] 
^^j^^ •'(1,00] 

= / Ua{dx)P sup ^(j) > X~^ 
J (l.oo] '-m<i<r 
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Therefore, by monotonicity and then monotone convergence, 



hm sup t P 



Xo > b{t), supXj^^*^^ > bit)] > lim / P sup ^(j) > x'^ 

j>m -' 



Vct{dx) P sup^(j) > X ^ 

(l,oo] ^j>m 



(1,00] i'n<j<.r 
Va{dx)fm{x). 



(1,00] 



Condition (2.14) imphes that J^i^-^ i^a{dx)fm{x) ^ as m — > 00. We claim that fm{x) — )• for 
any x > 0. Suppose instead that inf^, fmixo) > c > for some xq- Since the fm are ah increasing 
in X, we have inf^ fm{x) > c for x > xq. But this imphes that 



hminf / i'a{dx)fm{x) > hminf / 



{dx)fm{x) > CUa{'^ V Xq, Oo] > 



by Fatou's Lemma, contradicting Condition 2.14. Therefore, P[supj>^^(j) > x ] ^ as m ^ cxd 
for all X > 0, establishing (2.15). □ 

Condition (2.14) assumes the first observation exceeds 5h{t) which is in the spirit of (2.8). For 
translating the stronger convergence of unconditional distributions (2.10) in the bigger space E* := 
[0, 00]"*+^ \ {0} to point process convergence, we will require an additional assumption: 

(2.17) 3mo > 1 such that limlimsupt P [XQ/h{t) < 5, sup xf^^^^ /h{t) > a] = for ah a > 0. 

Analogously to (2.12), by Markov's inequality a moment condition is sufficient for Condition (2.17): 
limlimsup i E [( sup xj^*-*^''/6(t))^ l{Xo<(5b(t)} ] = 0, for some e > 0. 

Condition (2.17) implies a uniform bound on the ath moment of the tail chain states. 

Proposition 2.2. Let X ^ K £ D{G) be a Markov chain on [0, 00) with initial distribution H 
satisfying 1 — H €: RV_q, whose extremal component satisfies (2.10) on M_|_(E*). If X satisfies 
Condition (2.17) then, 

(2.18) Efsup^jr) <oo. 
Remark. Under (2.18), we necessarily have E^f < 1 since 

sup {E^fY = sup E^jT < E ( sup ^jT) < 00. 



Recalling (2.11), this means that the marginal tails of the extremal component are no heavier that 
the tail of H. 



Proof. Under (2.10), 



tP 



sup 

'mQ<j<T 



Va{dx) P sup ^(j) > X ^ 
(0,00] ^rriQ-Cj-Cr 



E( sup 



by Lemma 2.1. Therefore, 



lim sup tP sup Xf^*^^ > b{t) > lim E ( sup ^(i)" ) = E ( sup ^(j) 



L j>mo 



rno<j<r 
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Furthermore, by Condition (2.17), for some 5 > 0, 



limsuptPT sup Xf^*^^ > b{t) 



-j>mo 



< limsupt P Xo < Sb{t), sup x]^^*^^ > b{t) + limsupt P [Xq > Sb{t)] < oo 



j>mo 



t~>-oo 



showing that E (supj>^jj '^(j)") < oo which is enough for(2.18). 



□ 



Under both Conditions (2.14) and (2.17), we derive the tail behaviour of the maximum of the 
extremal component of X. 

Proposition 2.3. Let X ^ K £ D{G) be a Markov chain on [0, oo) with initial distribution H 
satisfying 1 — H € RV_q,, whose extremal component satisfies both (2.10) on M_|_(E*). If X satisfies 
Conditions (2.14) and (2.17), then 



(2.19) 

where c = P 



tP[ sup Xj/b{t) e ■] c-Va,{-) in M+(0,oo], 

0<j<T{b{t)) 



supj>i C(j) < 1 + E supj>i 1{ 



{sup,>i€(i)>i} 



Proof. For x > 0, we have [supj.^^(f,(j-)) Xj/6(t) > x] = [supj>o xj^*-*^^/6(t) > x\. For m > 1, we 
have on the one hand, by (2.10), 

liminftP [ sup Xj/b{t) > x] > lim t P [ sup xf^^^^ /b{t) > x] 

0<j<T{bit)) 0<j<m 

=x~" + / Va{du) Pu [ sup Tj > x] , 

J[0,x) l<i<m 

from which, letting m oo, 

(2.20) liminftP[ sup Xj /b{t) > x] > x'"" + i/^, (dn) P„ [ sup > x] . 

0<i<r(6(t)) J[0,x) j>l 

On the other hand, for 6 > we have 

t P r sup , >x] <tP > 6, sup , > xl + t P [— ^ < (5, sup , > xl . 

Given e > 0, by Condition (2.17), we may choose 5 small enough that 

limsuptP[Xo < 5b{t), sup xj^^*^^ > b{t)x] < e/2, 

where tuq is from Condition (2.17). Condition (2.14) permits the choice mi > rriQ so large that 

limsuptP[Xo > 5b{t), sup xj^^*^^ > b{t)x] < e/2. 



j>mi 



Therefore, limsup^^o^tP [ > x] < e for m > nii, and so 

limsuptP [ sup Xj/b{t) > x] < lim limsup tP \ sup xf^^^^ /b{t) > x] + e 

t->oo 0<i<r(b(t)) m-i^oo ^^qo 0<i<m 



-X 



+ / Va{du) Pu [supTj > x] + e. 

J\Q,x) 7>1 



Combine this with (2.20), and apply formula (2.13) for z/([0, x] x (x, oo]) to complete the proof. □ 
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3. Point Process Convergence for Markov Chains 

We now derive the limit of the exceedance point process Nn defined in (1.1), where X = 
{Xq^Xi, . . . ) is a Markov chain on [0, oo) with transition kernel K € D[G). We write 

oo 
j=0 

using the notation e^- to denote the measure assigning unit mass at the point x and Nn is a random 
element of Mp([0,oo) x (0,oo]), the space of Radon point measures on [0,oo) x (0,oo], endowed 
with the topology of vague convergence [13, 20, 25]. 

If X is positive recurrent, it is a regenerative process ([2, Section VII. 3], [18]) so the sample path 
of X splits into identically distributed cycles between visits to certain set. The extremal properties 
of X are determined by extremal behaviour of the individual cycles. This approach has been 
developed Markov chains by Rootzen [27], as well as for queues by Asmussen [1]. Our approach 
introduces the tail chain approximation to describe the extremal behaviour of the regenerative 
cycles using their extremal component. 

3.1. Cycle Decomposition. Consider the case where X has a positive recurrent atom A. For 
positive recurrent chains, atoms can be constructed by several methods if no natural atom exists. 
See, eg., [8, Chapter 6] or [17, Chapter 1.5]. An atom is a set such that for a probability distribution 
H on [0, oo), 

(3.2) K{y , •) = H{-) for all y G A and Py[TA < oo] = 1 for y > 0, 
and TA = inf{n > : Xn G A} is the first hitting time of A. Positive recurrence means that 

(3.3) Ehta<oo, 

where Eh denotes expectation with respect to H considered as the initial distribution of Xq. 

Under (3.2), the sample path of X splits into iid cycles between visits to A, as follows. Define 
the times {Sk}, {t^} recursively according to 

(3.4) = TA, So = + 1; 

= inf{n > : Xs,_,+n G A}, = S^-i + + 1, k> 1. 

Thus, the sequence 0<S'o — l<5'i — 1<S'2 — 1<-- - gives the indices when X is in A, and 
Xs^ ~ H for k > 0. The values > are the number of steps X takes outside of A between 
visits to A. The cycles end by visits to A; cycles are the random elements 

Co = {Xo,X,,...,X^a) and = iXs,_„ . . . , Xs^_^+,a), k>l 

A HA 

of the space of finite sequences S = Um=i '^^^ strong Markov property implies Cq, Ci, . . . are 
independent, and Ci, C2, . . . are identically distributed. In particular, for A; > 1, 

P [{Ck ■,T^}G ■]=?[{ {Xs,_, Xs^_^+^a) ■,T^}G-]=PH[{iXo,...,X^^);TA}e-]. 

Furthermore, < 5o < < 52 < • • • is a renewal process, with 

(3.5) q=E{Si-So) = EHTA + l<oo 

by (3.3). Applying the cycle decomposition, we may now write (3.1) as 

00 So — 1 00 '''k+l 

(3.6) ^n = Yl Hj/n,Xj/b,,) = Yl Hj/n,X,/b„ + J] HiSk+j)/n, Xg^+j /b,,) = Xn + Xn- 

j=0 j=0 k=0 j=0 
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As a family of random elements in Mp([0, oo) x (0,oo]), {Xnl asymptotically negligible. 

Lemma 3.1. Assuming (3.3) and bn — )• oo, Xn ^ measure, in Mp([0, oo) x (0,oo]). 

Proof. Let / G C^([0,oo) x (0,oo]) with support in [0,R] x [M, cxd]. It is sufficient to verify that 
P[Xnif) > 7] ~^ 0, for any 7 > 0. We have as re — >• 00, 



P [X'nif) > 7] = P 



j=0 



re bn 



> 7 



SEP 

m=0 



j=0 



re' 6„ 



> 7, T-A 



m 



+ P[ta> 



and 



r p m ■ V 



m=0 '- i=0 

< (r + 1) P 



< (r + 1) P 



E/(-.f^ 

^ Vre br,. 



> 7 



sup Xj >bnM 

LO<j<r 



Choosing r to make P[ta > r] arbitrarily small, the result follows. 



□ 



3.2. Point Process Convergence. Lemma 3.1 and Slutsky's theorem means that the asymptotic 
behavior of Nn and Xn the same. We obtain a weak limit for Xn using the tail chain approxi- 
mation discussed in Section 2, provided that a cycle's extremal behaviour is adequately described 
by its extremal component. 

As usual, assume K € D{G), 1 — H G RV_q, and suppose y{t) is an extremal boundary for X. 
We require a mild assumption that the atom j4 be a bounded subset of the state space [0, 00), 

(3.7) sup A < 00, 

as would usually be the case. Fix k > 1. The the number of steps needed by the scaled process in 
the k-th cycle to cross below the extremal boundary is 

n{t) = inf {re > : Xs,_,+n < ty{t)}. 

The extremal component of the A;-th cycle is Ck{t) := {Xs,._^+j : j = 0, . . . , Tk(t) — 1}. 

Without loss of generality, we suppose the extremal component of a cycle is a subset of the 
complete cycle. To see this, observe from the definition of r(t) and ta, without loss of generality, 

(3.8) P [T{t) <TA,yt>0] =1. 

Indeed, (3.7) implies that A C [0,c] some c. Define Tc = inf {re > : X„ < c} and P[tc < ta] = 1; 
we claim further that we may suppose P[T(t) < Tc, Vi > 0] = 1. If y{t) > c/t for all t > 0, then this 
follows directly. Otherwise, verify that y{t) = y[t) y c/t is also an extremal boundary for K (see the 
remarks after (2.2)), and the corresponding downcrossing time satisfies P[f (t) < Tc, Vi > 0] = 1. 
Therefore, for A; > 1, 

(3.9) P[{{Xs,_,,...,Xs,_,+r,{t)^i);Tu{t),T^} G •] = Ph [{(Xo,...,X,(i)_i); T(t),rA} G •] 

and {{Ck{t); Tk{t),T^) : A; > 1} are independent, since each is a function of {C^; r^}. These 
facts suggest we approximate Xn by a point process whose observations consist of the extremal 
components of iid copies of the chain X started from Xq ~ H. This approximation is facilitated by 
additional notation. Let {X , X^ = {Xf^j,j > 0) : A; > 0} be iid copies of the Markov chain X ^ K 
with respect to P_h'(-); that is the initial distribution of each chain is fixed to be H. Define 

ffc+i(i) = mf{j>0:Xfcj- <ty{t)], /c = 0, 1, . . . , 
and for A; > 0, form the extremal component X^^"^ = {X^j ■ l{j<ffc+i(i)}i J ^ 0} of the A:th chain. 
Thus with respect to Ph{-), T'k+i{t)) = {^^^\ T'i't)) for A: > 0, with the tilde differentiating 
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the times fk{t) defined on the kth process Xf^ from the cycle times rfc(t) defined on X. Recall r(t) 
is also defined on X. 

Next, generate an iid family of tail chains by letting = {S,kin)i re > 0) : A: > 0} be 

iid copies of the process $, = n > 0), recalling the notation around (2.1). Additionally, put 

r^^^ = inf{j > : ^fc(j) = 0}, the first time the kih tail chain hits 0. Use the convention inf = oo; 
for example, T^_^_l = oo a.s., A; > 0, if G({0}) = 0. Finally, let 

oo 
fc=0 

be a Poisson random measure on Mp([0, oo) x (0, oo]), independent of the {^^j, with mean measure 
a product of Lebesgue measure on the time axis [0, oo) and Pareto measure fa (given by fQ,(x, oo] = 
x~°') on the observation axis (0, oo]. Recall a is the tail index of H. 

The point process consisting of the observations X^^"\ spaced in time according to the renewal 
times {Sfc}, converges to a cluster Poisson process which is basically <^ with a time scaling and 
compounded in the second coordinate according to the iid tail chains {^^j. This result is basic to 
analyzing the asymptotic behavior of Nn in (3.1). 

Proposition 3.1. Let X he a Markov chain on [0, oo) with transition kernel K G D{G), and initial 
distribution H, such that tH{h{t)-) A- fa(-) in M4.(0,oo], where b{t) — )> oo and bn = b{n). The 
renewal process {St} is defined in (3.4), with mean interarrival time q given by (3.5). With the 
notation introduced in the previous paragraphs, we have the following with respect to Ph- 

(a) If X satisfies Condition (2.14) (p. 5), then given 6 > 0, in Mp([0, oo) x (0, oo]), as n —?> oo, 

oo "Tfe + lCbn) — 1 OO """fc+l^l 

(3.10) Vn-=Y1 Yl ^) '^{XkO>Sbn} ^Y Yl HHk^ikUkU)) lfc><5} =■ ^ 

k=0 j=0 ^ " ' ^ k=0 j=0 

(b) Suppose X satisfies (2.10) (p. 4) well as both Conditions (2.14) and (2.17). Then in 
Mp([0, oo) x (0, oo]), as n ^ oo, 

oo ^fc+lCfen) — 1 OO ''"fe+l"-'- 

(3-11) ^n-=Yl Yl ^(EiL+i^)^Y Y H<itk,iA{j)) =-'n*- 

k=0 j=0 ^ " ' k=0 j=0 

Section 4 (p. 16) contains the proof. Paralleling the discussion in Section 2, we have two results 
depending on the strength of the conditions. The weaker assumptions of part (a) yield a result 
that selects cycles starting from an exceedance. Part (b) does not have to do such cycle selection. 

The points of the limit process are arranged in stacks above common time points gt^. The heights 
of the points in each stack are specified by an independent run of the tail chain starting from z^. If 
G({0}) > 0, then the are iid Geometric random variables with parameter G({0}), so all stacks 
have finite length. If G({0}) = 0, then P[r^ = oo] = 1 for each k. In this case. Condition (2.14) is 
necessary to ensure that rj* is Radon, by forcing the tail chain to drift towards as in (2.16). The 
process r] retains only those stacks of rj* whose initial value exceeds the threshold 6. Because there 
are an infinite number of in any neighbourhood of 0, dispensing with the restriction in 6 requires 
that not too many of the S,k{j) are large. This translates to the condition E,^f < 1, provided by 
Condition (2.17). 

To analyze Nn in (3.1), we approximate Xn ™ (3-6) by 77* in (3.11), provided the extremal 
component adequately describes extremal behaviour within each cycle. If the extremal boundary 
is not identically zero, behavior between the end of the extremal component and the end of the 
cycle is not be captured by the tail chain and we require that such observations do not significantly 
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influence extremal properties. To guarantee a result analogous to Part (a) above, we require, 
(3.12) lim P [ sup Xj/b{t) > alXo > 6b{t)] = for all a,5 >0, 

Tm)<j<rA 

and for a result analogous to Part (b) above, we require, 
(3.13) 



lim t P [ sup Xj/b{t) > a] = for all a > 0. 

T{b{t))<j<TA 



t—^OO 



With these conditions, the point process A^^^ converges to the limit rj* , and the distribution of the 
cycle maximum behaves as if it has a regularly varying tail. 

Theorem 3.1. Let X be a Markov chain on [0,oo) with transition kernel K € D{G). Suppose 
that K has a positive recurrent bounded atom in the sense of (3.2), (3.3), and (3.7). Define the 
renewal process {Sk} with mean interarrival time q as in (3.4) and (3.5) and assume further that 
tH{b{t)-) A fa(') in M_|_(0, oo], where b{t) — >■ oo. With respect to Ph, the following hold. 
(a) If X satisfies Conditions (2.14) and (3.12), then given 6 > 0, 



(3.14) 



0<j<So '"^ k=l 5fc_i<i<5fe 



in Mp([0, oo) X (0,oo]), as n ^ oo, where t] is defined in (3.10). 
(b) Suppose additionally that X satisfies (2.10) as well as Conditions (2.14), (2.17) and (3.13). 
Recall rj* from (3.11). Then, in Mp([0,oo) x (0, oo]), as n ^ oo, 

(3.15) Nn^v*, 

and furthermore, the distribution of the cycle maximum has a regularly varying tail, 

(3.16) 



tPH[b{ty^ sup Xj e ■] c ■ Uai-) in M+(0,oo], 

0<j<TA 



where 



(3.17) c=P 
Proof, (a) First, note that Nn = Xn + Xn, where 



SUp^(i) < 1 +E SUp^(j)"l|sup ^^^(j)>i} 



oo ' k + l 



k=0 7=0 ^ " ' 



Hence, by Lemma 3.1 it remains to show that Xn =^ V- Split Xn according to the times {rfc(6„)}: 



oo T-fc + l(fcn) — 1 

k=0 j=0 ^ 



oo ' fe+l 



n ' bn 



]'^{Xs^>Sbn}+Yl E ^(Sk+i ^Sj^]^{Xs^>Sbn} - Xn + Xn- 



k = Oj=Ti:+l{bn) 



The equality holds on the set {Tk{bn) ^ ', n > l,k > 1}, which has probability 1 by (3.8). 

Because of (3.9) and the independence of the {Ck{t),Tk{t)), we have Xn = for each n, and 
Proposition 3.1(a). 

P [Xn(/) > 7] = P 



=^ ^ by Proposition 3.1(a). By Slutsky's theorem, the result follows if Xn =^ 0) so we show 

'Sk + j Xs, 



oo 



E E / 

k = j=Tk + l{bn) 



n 



0, 
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for any / G C^([0,oo) x (0,oo]) and 7 > 0. Let / have support [0,R] x [M, 00]. The prevfous 
probabihty is bounded by 



2Rn-l 



f[E E /(^.^)i(-.,.i».)>o] 



fc=0 j=rfc + i(fe„) 



n 



fc=2/?n j=Tfe+i(fe„) 



K 



n 



)1 



>o 



Observe that the second term is at most P[S2Rn/n < R] = P[S2Rn/'^Rn < 1/2] ^Oasn— )-oo, 
since 5„,/re — > g a.s., and <? > 1 by (3.5). The first term is bounded by 



2Rn-l 

U 

fc=0 



> 5^r^ 



''"k + l 

u 

3=rk+i{bn) 



'k+j 



bn 



> M 



< 2RnPH 



^0 >A 

On 



sup 



> M 



ribri)<j<TA 



which vanishes as n — > cx) by Condition (3.12). 



(b) Recalhng the decomposition (3.6), by Lemma 3.1 it is sufficient to show that Xn ^ V* ■ This 
fohows by a similar argument as in part (a). Write 



00 rj._|_i(b„)-l 
fc=0 j=0 \ r, ' b„ 



'k+l 



)+E E 



Then Xn = ^ V* by Proposition 3.1(b), and Condition (3.13) implies that Xn" ^ 0- 



Next, we show (3.16). In light of (3.8), we have 



0<tPH 



Xj 

0<j<TA "['^) 



tp 



H 



under Condition (3.13). Recalling that 

tPn 



Xj 

sup T16: > X 

0<j<r{b{t)) 0[t) 



<tP 



H 



Xj 

sup 7777 > a; 

T{b{t))<j<TA 0(*i 



cx 



□ 



as t — >• 00 by Proposition 2.3 (p. 7), where c is as in (3.17), completes the proof. 
Setting Mn = Vo<j<nXj, Rootzen shows [27, Theorem 3.2] that (3.16) implies 

P[Mn < bnx] — > exp(-C(?"^x"°), x>0, 

where c is given by (3.17), and q is the mean interarrival time (3.5). Hence, in the stationary case, 
6 = c/q is the extremal index of the process X ([15, Section 2.2], [14]). On the other hand, for 
stationary regularly varying Markov chains with K S D{G) satisfying a condition analogous to 
Condition (2.14), it is known [4, Remark 4.7], 



supye(j) < 1 



supC(j) < 1 



sup^(j)" l{sup,>i?0)<i} = c - E ( sup^(i)°], 



where Y ~ Pareto(a) supported on [l,oo), independent of Hence, for a stationary Markov 

chain X satisfying the assumptions of Theorem 3.1(b), the extremal index is given by 

1 / N E ( sup >i^(i)^ 

E(supe(jr 
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3.3. Discussion of conditions. We now consider simplifications of tiie above conditions. 

3.3.1. Cases where G({0}) = 0. If G({0}) = 0, we can replace X^^^*)) with X in the finite- 
dimensional convergence (2.8) (p. 4) when H has a regularly varying tail, meaning that the tail 
chain approximation completely describes the extremes of the chain X in a finite dimensional sense. 
However, G{{0}) = also implies that for any m > 0, as t — )• oo. 



(3.18) 



Pt[m < T{t) < ta] 1 



(see [26, Proposition 5.1 (d)]) meaning that, as the initial observation becomes more extreme, it 
takes longer for X to return to A to complete the cycle. Hence, for Condition (2.14) to hold, we 
need a condition that ensures X eventually drifts away from extreme states: 



(3.19) 



lim lim sup 



sup Xj > ta 

m<j<TA 



for all a > 0. 



Proposition 3.2. Suppose X ^ K ^ D{G) with G({0}) = and positive recurrent bounded atom 
A, and Xq ^ H with 1 — € RV_a. If X satisfies Condition (3.19), both Conditions (2.14) and 
(3.12) hold and consequently, the convergence (3.14) takes place. 



Proof. We first show that 
(3.20) 



lim lim sup iP// \Xo/b{t) > 5, sup Xj/b{t) > 

m^oo m<j<TA 



for all a,6 > 0. 



Indeed, for c > 5, we have. 



^^^i^>^^ ^>«]^/ ^^H[^edu]Pk(^t)^[ sup ^>a]+tPH[^ 

^b{t) m<j<TAKt) J[5,c] Kt) m<j<TAKt) Kt) 



>C 



Furthermore, for 6 < u < c, 



b(t)u 



^3 



< P 



b{t)u 



m<j<TA 0\l)U 



a 

> - 

c 



< sup Ps 

s>b{t)S 



Xj a 
sup — - > - 



Hence, by Condition (3.19), 

X X 
lim lim sup t Ph \ttt > ^^P 77^ > a] < z^q[(5, c] • + Va{c, oo] = c~". 

m->oo t^oo b{t) m<j<TA 0(t) 

Letting c ^ oo establishes (3.20). As (3.8) implies that sup^<j<^(ft(j)) Xj < sup„<j<^^ Xj, Condi- 
tion (2.14) follows. To verify Condition (3.12), argue that 

Y TA-l Y Y Y '^A-l Y 

" \X.. > °1 ^ > ^''<'» ^ - - '] - [k5 > V > "] . 

of which the first term vanishes as i ^ oo because of (3.18) (see [26, Proposition 5.1 (d)]). Appeal 
to (3.20) and let m — > oo to complete the proof. □ 

Condition (3.19) is a condition on the transition kernel K\ this is best discussed by recalling (see 
[26, p. 5] for discussion) that a transition kernel K G D{G) has an update function tp of the form 

(3.21) iP{x,{Z,W)) = Zx + (l){x,W), 

where Z G and t~^(/)(t, w) ^ for w G C with P[I^ G C] = 1 and we can represent K as 

K{x,B) = P[i;{x,{Z,W)) G B]. 
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Take Vr = {Zr, Wr), iid copies oiV = {Z, W), and write Vr = {Vi, . . . ,Vr). For r > 1 let '>p''{x, Vr) 
denote the r-step update function, i.e., K^{x, B) = P[ip'''(x, Vr) G B], and ip^{x) = x. By iteration, 

r 1 — 1 r 

ij'{x,Vr) = (n^i)^ + E( n Z,y{i^'~\x,Ve-i),We) + (t>{V-Hx,Vr-i),Wr). 
3=1 £=1 j=e+i 

Thus Condition (3.19) requires both Zm — ?> as in (2.16), and also an asymptotic stochastic 
boundedness condition on (j){-,W). Alternately, one could give criteria for Condition (3.19) using 
mean drift conditions for X or log X [17, p. 229]. 

3.3.2. Cases where G{{0}) > 0. In this case, (2.4) implies 

Pt^[T{t) = m] ^ P[t* = m], m>l, 

where r* is a Geometric random variable with parameter G({0}). Hence, the tail chain terminates 
after a finite number of steps. If either yo{t) = is an extremal boundary, or K satisfies the 
regularity condition (2.5) (p. 3), Theorem 2.1 assures us that convergence (2.8) holds for X with 
respect to Ph, and Condition (2.14) follows directly since 

limsuptP[supX]''^*V^W > a,Xo > 6b{t)] < limsuptP[Xo > 56(t), r(5(t)) > m] 

= J faidx) P[xi{m) > 0] — > {m ^ oo). 

The regularity condition (2.5) extends to any finite number of steps; that is, iterates of K also 
satisfy the condition. However, unless ?/o(t) = is an extremal boundary, we need the regularity 
condition to hold uniformly over the whole cycle of random length ta to prevent X from returning 
to an extreme state within the same cycle, after crossing below the extremal boundary. Condition 
(3.22) given next accomplishes this. (Note that even if yo(*) = is an extremal boundary for 
we are using an extremal boundary y{t) chosen to satisfy (3.8).) 

(3.22) lim Ptut sup Xj > ta =0 whenever ut = u{t) 0, a > 0. 



l<i<TA 



Recalling the update function form (3.21), the regularity condition (2.5) holds if the function 
(j){-,w) is bounded near for each w in a set of probability 1 [26, Proposition 4.1]. Condition 

(3.22) is a stronger boundedness restriction on <j){-,w) near 0. Alternatively, when K satisfies the 
regularity condition (2.5), Condition (3.22) may be viewed as a restriction on ta, since then 

(3.23) lim limsupPf^Jr^ > m] = whenever Uf = u{t) — >■ 0, 

it is sufficient for (3.22). This follows from the decomposition 

TA > m 



tut 



sup Xj > ta 

l<j<TA 



tUt 



sup Xj > ta 

l<j<m 



with (3.23) controlling the first right hand term and (2.5) controlling the second. 

Proposition 3.3. Suppose X ^ K ^ D{G) with G({0}) > and 1 - H e RV_a and X has a 
positive recurrent, bounded atom A. Then X satisfies (2.14) with respect to Ph- Moreover, if either 

(i) yoi't) = is an extremal boundary for K , 

or 

(ii) Condition (3.22) holds, 

then Condition (3.12) holds with respect to Ph and thus convergence (3.14) (p. 11) takes place. 
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Proof. First, note that by [26, Proposition 5.1 (d)], as t oo. 



(3.24) t Ph 



-^0 ^ )- 
— — > 0, sup 



xMt)) 



> a 



<tP 



H 



X, 



^ > 5, r(6(t)) > m 



:i-G({0}))' 



b{t) ' b{t) 

Since G({0}) > 0, the right side of (3.24) vanishes as m — > oo, estabUshing Condition (2.14). Next, 
to analyze Condition (3.12), consider the case where yo{t) = is an extremal boundary, and write 
To = inf{n > : X„ = 0}. For any m, 



t Ph [Xo/b{t) > 5, sup Xj/b{t) > a] 

T{b{t))<j<TA 



(3.25) 



<J2^Ph [Xo/b{t) > 5, T{b{t)) = r,To>r]+tPH [Xo/b{t) > 5, T{b{t)) 



> ml 



r=l 



which is obtained by splitting according to whether r(6(t)) < m or the complement and using the 
fact that T{b{t)) = r and sup^(^(-j))<j<^^ Xj/b{t) > a implies tq > r. For a typical term in the sum, 

t Ph [Xo/b{t) >5, T{b{t)) =r,To>r <tPH [Xo/b{t) >6,ro>r]-tPH [Xo/b{t) > 6, r{b{t)) > r] 

r"(l - G{{0})Y - (5-^(1 - G{{0})Y = 0, 

the convergence following from [26, Proposition 5.1 (d)], since both y{t) and yo{t) are extremal 
boundaries. The right most term in (3.25) is handled as in (3.24). 

Finally analyze Condition (3.12) when Condition (3.22) holds. For any m, we have 

tPH[Xo/b{t) > 5, sup Xj/b{t) > a] 

T{b{t))<j<TA 



<Y^tPH [Xo/b{t) > 5, sup Xj/b{t) > a, r(6(t)) = r] + 1 [Xo/b{t) > d, r(6(t)) > m] , 

r<j<TA 



r=l 



and 
(3.26) 



where 



tPH[Xo/b{t) > 5, sup Xj/b{t) > a, T{b{t)) 

r<j<TA 



{S,oo] X (y(fe(t)),oo]'— 1 X [0,oo] 

ht{x) = 1 



tPn [{Xo,Xr) /b{t) G d{xo,Xr)]ht{2 



{[o,y{6(t))]}(a;) Pb(t)x [ sup Xj/b{t) > a]. 
i<i<TA 



We claim that ht{ut) whenever ut ^ u > 0. Indeed, if it > 0, then ht{ut) = for large t such 
that y{b{t)) < u. Otherwise, ut 0, and ht{ut) — )• by Condition (3.22). Therefore, the integral 
converges to by combining Lemmas 8.2 and 8.4 with Theorem 3.2 from [26]. Applying (3.24) 
completes the proof. □ 

3.4. Weak convergence to a cluster process. If the finite-dimensional distributions of X are 
jointly regularly varying (in the sense of (2.10) with X replacing X^^^^^), we obtain a point process 
limit for X under a condition analogous to Condition (2.17): There exists itiq > 1 such that 

(3.27) limlimsupt Pi^ [Xo/b{t) < S, sup Xj/b{t) > a] = for all a > 0. 

Proposition 3.4. Suppose X ^ K ^ D{G) has a positive recurrent, hounded atom A, and 1 — H ^ 
RV_a. Assume further that, with respect to Ph, X is regularly varying in the sense of (2.10) (p. 4), 
with X replacing X^^^^^\ and satisfies Condition (3.12). Under Condition (3.27), both Conditions 
(2.17) and (3.13) hold with respect to Ph- 
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Proof. Recalling sup„<j<^(^(j)) Xj < sup^<j<^^ Xj yields (2.17). Next, given 5 > 0, write 



T{b{t))<j<TA 



< P 



H 



-^0 ^ X ^ 
— > (), sup TTTT > a 

T{b{t))<j<TA 



+ P 



H 



XQ r Xj 

TT-T < 0, sup TT-T > a 

b{t) i<j<TA o{t) 



Condition (3.12) makes the first right side term go to as t — > oo and for the second term we have, 



lim sup Ph 

t—^oo 



^0 / J- Xj 

< 0, sup — ^ > a 



< lim sup Ph 



^0 - - Xi 



TT-T < 5, sup -rA> a 
b{t) i<j<m'o b{t) 



+ lim sup Ph 

t—^oo 



XO ^ r- Xj 

TT-T < 0, sup TT- > a 

b[t) m'g<j<TA b{t) 



fi*{[0,5] X [0,0]"=) +limsupPH 



Xo ^ . Xj 

77-7 < 5, sup 7777 > « 
b{t) rn'^<j<TA b{t) 



where a = (a, . . . ,a). Letting J, 0, the first term vanishes by (2.10), since fj,*(E.'^ \ E^) = [26, 
Theorem 5.1]. The second term is taken care of by Condition (3.27). □ 

We now rephrase Theorem 3.1 in terms of our new conditions. 

Theorem 3.2. Let X be a Markov chain on [0, oo) with transition kernel K E D{G) such that K 
has a positive recurrent bounded atom in the sense of (3.2), (3.3), and (3.7). The initial distribution 
H has a regularly varying tail and satisfies tH{b{t)-) A fa(-) in M_(_(0, oo]. Assume for any m > 
that {Xq, . . . ,Xm) is regularly varying in M4.([0, oo]™"^"^\{0}), 

(3.28) tPH[{Xo,...,X^)/b{t) € {dxo,dx^)] -^ly^idxo) P^o [{Ti, . . . ,Tm) e dx^] 

and that Condition (3.27) holds with respect to Ph- 

(a) //G({0}) =0, and K satisfies Condition (3.19), then 

oo oo oo 

IZ^fi i) ^^^H<ltk,^kik{j)) Mp([0,oo) X (0,oo]) asn^oo. 

j=0 ^"'''"^ fc=Oi=0 

(b) //G({0}) > 0, and either yo^t) = is an extremal boundary for K , or K satisfies Condition 
(3.22), then 



T* —1 

OO OO 'fc+1 ^ 

i=o ^ " ' " ' k=0 j=0 



in Mp([0, oo) X (0, oo]) as n ^ oo, 



where the {r^} are iid Geometric random variables with parameter G({0}). 



4. Proof of Proposition 3.1 

Recall that {X,Xk,k > 0} are iid copies of a Markov chain X ^ K with heavy tailed initial 
distribution H satisfying tH{{b{t)-) A t'a(-) in M+(0, oo] where b{t) — t- oo. The extremal boundary 
downcrossing time by Xj. is {Tk{t)}. Let {$,,^k,k > 0} be iid copies of the multiplicative random 
walk ^ = {^{m),m > 0}. The hitting time of by is r^. A PRM on [0, oo) x (0, oo] with mean 
measure LEBxi/q,, independent of the {^,$,k,k > 0} is C = ^^{tk,ik) and {5^} is the renewal process 
given by (3.4) with finite mean interarrival time q. For convenience, write X^^^^ = {xj^^, . . . , X^^) 
and ^k,m = (Cfe(O), . . . ,Cfe(m)). 
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Proof of Proposition 3.1. (a) First, recall that, under our assumptions K € D{G) and H having 
a regularly varying tail, the convergence (2.8) takes place for the chain X on the space := 
(0, oo] X [0,oo]'", with limit measure ^ given by (2.7). This implies that [24, Corollary 6.1, p. 183], 

oo oo 
fc=0 fc=0 

in Mp([0,cx)) X E^). Since YX=oHtk,ik,ik m) ''^^ °^ [0'°°) ^ d^^, Proposition 5.3, p. 123]), 

a mapping argument ([24, Proposition 5.2, p. 121]) implies 

oo oo 

Y.Htk,ik^u,n.) = ^Ht„i,A^^t,{i),...,i,U^)) ~ PRM(LEB X /i), 
fc=o fc=o 
in Mp([0,oo) X E^), by (2.7). So we can rewrite (4.1) in Mp([0,oo) x E^), 

oo oo 

Second, we rescale the time axis to place points at the epochs Si^. (See [19].) The counting 
function for the points {S^} is N{t) = ^kesf,[0,t] and N^{t) = inf{s : N{s) > t} = S[t] is the 
left continuous inverse process. Define 0n(-) = n~^N'^{n-), so that Sk/n = @n{k/n) and 0„ is a 
random element of D^[0, oo), the subspace of non-decreasing elements of Dieft[0, oo). By the Strong 
Law of Large Numbers, with probability 1, 

enit)=^^^^t-,, t>0, 

n [nt\ 

so Qn{-) q{-) in -D^[0,oo). We transform time points using the mapping Ti : D^[0,oo) x 
M+([0,oo) X E^) ^ M+([0,oo) X E^) given by 

(4.3) Ti?n(/) = f{x{u),v)m{du,dv), f G C+([0,oo) x E^). 

Applying [24, Proposition 3.1, p. 57] to (4.2), we have iQn{-),'&n) in I^^iO, oo) x 

Mp([0, oo) X E^). Since Ti is a.s. continuous at (g(-),??) (Lemma 4.1, p. 21), the Continuous 
Mapping Theorem gives in Mp([0, oo) x E^), 



(4.4) r?; = E^(5,/n,xitV^>n) = ^ TM-),^) =T.H,t„i,U 



k=o ' ■ """" ' k=0 



Now stack the components of X^^"^ above the time point Sk/n. To make functionals continous, 
it is necessary to compactify the state space by letting := [5, oo] x [0, oo]™". Define the restriction 
functional Ts : Mp([0,oo) x E^) Mp([0,oo) x As) by Tgrn = m(- n ([0,oo) x As)). From [9, 
Proposition 3.3], T2 is almost surely continuous at t]' provided P[?/(9([0, 00) x As)) = 0] = 1 and 
since E(r/'((9([0, 00) x As)) = due to fo({(5}) = 0, the a.s. continuity is verified. Therefore, in 
Mp([0, 00) X As), the restricted version of (4.4) is ij" := T2{r]'n) => T2{r]') =: ?/'. Define the stacking 
functional T3 : Mp([0, 00) x As) Mp([0, 00) x [0, 00]) by 



^3(E^(i„,,(0),...,,,M))-EE^ 



(ifc.yfc(i)) 

k j=0 

or for m G Mp([0,oo) x A^), / G C+([0,oo) x [0,oo]), Tsmif) = JJ {ET=o fiu^^j)}^idu,dv). 

Given such / with support [0, R] x [0, 00], the function ip{u, v) := XljLo /(^' '^i) ^ ^k{[^^ ^) ^ ^<5)' 
since it is clearly non- negative continuous, and 93 = outside of [0,i?] x A^. The continuity of 



18 



S. I. RESNICK AND D. ZEBER 



T3 is clear: given m„ A m in Mp([0, 00) x Ag), we have T^mn{f) = rnni^) ~^ "^(9^) = 23m(/). 
Consequently, in Mp([0,oo) x [0, cxo]), 



(4.5) f,r, = J2J2^ 



k=Oj=0 



Now adjust the sum over j to replace xj!'j^'^ with Xj^j. From (4.5), we readily get, 
(4.6) 77„(- n ([0,00) X (0,00])) ^ ?7(- n ([0,00) X (0,00])) in Mp([0,c3o) X (0,00]) 

by noting that any / G C^([0, 00) x (0, 00]) extends to / G C^([0, 00) x [0, 00]) with /(s, 0) = for 

s > 0. Moreover, recalling {fk+i{t)} and {r^^-^}, the first hitting times of by {X^*^} and {^^j 
respectively, put 

fjfc+i(t) = ffc+i(t) A (m + 1) and a^^i = t^^i A {m + I) , k>0. 
Using this notation, the convergence (4.6) becomes, in Mp([0,(X)) x (0, 00]), 

00 ""fc+lC^n) — 1 

^^ = Y1 Yl ^f£fc 



(4.7) 



IT* —1 

E E ■ 



Equation (4.7) allows spreading the stacks of fjn in time and we verify 

00 O-fe + lC'n) — 1 



(4.8) 



k=0 j=0 



Sk±i 5k^\ 1/£m)>5i ^ ^ ™ Mp([0, 00) X (0, 00]). 



This follows from Slutsky's theorem if dy{f]^, fjn) 0, where is the vague metric on Mp([0, 00) x 
(0,00]); it suffices to show that P[|?7^(/) - '??n(/)| > 7] — for any / G C^{[0,oo) x (0, 00]) and 
7 > 0. For such / with support [0, R] x [M, 00], we have 

P[\&)-Uf)\>i] 

00 0-k + l{bn)~l 



< P 



k=0 j=0 

00 Cfc + lC'n) — 1 

E E 

fc=0 j=l 



00 O-fe+lCfln) — 1 



E E /(^.^^)^^M«-E E /f^.^^V 



fc=0 j=0 



> 7 



/ 



Sk + j -'^/tj 



n 



f 



f Sk_ Xkj 



Since / is uniformly continuous, given p > 0, there exists u > such that \f{x) — f{y)\ < p whenever 
\\x — y\\ < V. For n so large that m/n < v, we have 



0-fe+l(fln)-l 



E E 

k=0 j=l 



f 



Sk + j Xkj 



Sk Xkj 



f 



([0,i?] x [M,oo])l 



n ^ bn J '' \ n ^ h, 
<p-fin{[0,R] X [A/, 00]), 

implying that 

limsup P [|C(/) - Vn{f)\ > 7] < limsupP [7?„([0,i?] x [M,oo]) > jp-' 

<P[7?([0,i?] X [M,oo]) >7P"'] 
by (4.7). So (4.8) follows by letting p^O. 
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Finally, we remove the restriction in m on the stacks. Recall the definitions of r/„, and rj from 
Proposition 3.1. To apply a Slutsky argument (eg. [24] Theorem 3.5, p. 56), we show, for 7 > 0, 



(4.9) 



lim P Idylf],!]) > 7] = and lim limsupP [^^(77* , ry„) > 7] = 0. 



Let / S Ci([0, 00) X (0, 00]) with support [0, R] x [M, 00]. Taking 6 < a < 00, we write 



00 00 



\v{f)-r]{f)\=Y^ f{qtk,ik^k{j)) ■ {'\-{5<ik<a} + ^{ik>a})- 

k=0 j=m+l 



Hence, 



00 00 



P[m)-v{f)\>l]<P E fiQtk,ik(k{j))l{s<.,<a}>l/2 

'-k=Oj=m+l 



+ P 



00 00 



Y Y f^1^k:ikik{j))'^{i„>a}> 1/'^ 
k=0 j=m+l 



A + B. 



Writing ^l.{m) = supj>^_|_]^ ik{j) for > 0, term A is bounded by 



< P [C([0, R/q] X [5, 00] X {M/a, 00)) > O] , 



E{^fe.H.)([0'^/«]x['^'«)) Y lfcO)>M}}>0 

■ fc=0 j=m+l 

where, since {^fc} are iid and independent of in Mp([0, 00) x (0,oo] x [0,oo]), 



Cm 



'(*fc.«fc.St('Ti)) 



PRM(LEB X X P [ sup e •]). 



fc=o J'^^+i 
Therefore, P[C([0,/?/g] x [5, 00] x (Ma-\ 00)) > 0] = 1 - exp{-A}, where 

A = LE]B[0, R/q] ■ 00] • P [ sup ^{j) > Ma~^] = Rq~^6~'^ P [ sup ^(j) > Ma'^] — > 

j>m+l 

as m —7- 00 by (2.15), a consequence of Condition (2.14). For term B we have the bound 

P [C{[0, R/q] X [a, 00]) > O] = 1 - exp { -EC{[0, R/q] x [0,00])} = 1 - exp{-i?g"^a-"}. 

Letting a — > 00 establishes the first limit in (4.9). 
To prove the second limit in (4.9), observe that 



P[lC(/)-^n(/)|>7] 



00 Tt;^i{bn)~l 

E E / 

k=0 j=m+l 



Sk+j Xkj 



n 



< P 



2_Rn-l 00 

E E / 

k=0 j=m+l 



Sk + J ^kj 



n 



> 



+ P 



k=2Rn j=m+l 



E E / 



> 



The first term is bounded by 

■2Rn-l 



U 

fc=0 







j=m+l 



X. 



(bn) 



< 2RnP 



-r^ > 5, sup — > M 
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lim lim sup n P 



^>5, sup ^—>M 







by Condition (2.14). The second term is at most P[S2Rn/n < R\ = P[S2Rn/'iRn < 1/2] as 
n oo, since 5„/n q a.s., and Q' > 1 by (3.5). This estabhshes (4.9), completing the proof of 
part (a). 

(b) This amounts to removing the restrictions in 5, under the additional assumptions (2.10) and 
Condition (2.17). We proceed via a Slutsky argument showing that for any 7 > 0, 

(4.10) lim P \d^,{r], ??*) > 7I =0 and lim lim sup P ^^^^,(r/„, r/*) > 7] =0. 

5^0 5-^0 n^oo 

Let / G C]^([0,oo) X (0,00]) with support [0, ii] x [M, 00], and note that 

00 ''"fc+i"-'- 

Hence, writing = supj>i Cfc(i), and C' = Efclo «(tfc,ife,ifeC^)' have 

r 00 00 

L k=o j=o 
<P [C\[0, R/q] X {0,5) X [M,(X)]) >0]. 

The {^^j} are iid and independent of C, so C' ~ PRM(/i') on Mp[0, 00) x (0, 00] x [0, 00] with 

IJ,'{ds,dx,dy) = UEM{dx) ■ Va{dx) ■ P [sup.^(j) € x~^dy] 

by [24, Proposition 5.6, p. 144]. Therefore, P [C'{[0,R/q] x (0,5) x [M,oo]) > O] = 1 - exp{-A}, 
where by Lemma 2.1, 



A = Rq-^ / u^idx) P [sup^(i) > Mx-i] < Rq~Hr^ E [supe(jr • l{sup,.>i 



(0,5) 



i>i 



_^(i)>Af5-i} 



Apply (2.18) and dominated convergence as 5 J, to get A ^ and hence the first limit in (4.10). 
For the second limit in (4.10), we have 



P[l^n(/)-<(/)|>7] =P 



"Tk + libn)-! 



< P 



2Rn~lfk + libn)-l /n , ■ V \ 

E E /f^'x')ix(-) > 

^ On ' 
00 Tfc+i{6„)-l / r» 1 • \^ \ 



fc=0 j=0 

+ P 



00 'ffc+l{t'n)-l 

E 

k=2Rn j=0 
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As above, the second term is at most P[S'2_Rn/n < -R] — > as n 



2Rn-l 

u 

fc=0 



X 



kO 



bn 



< S 



n U 



= 1 



> M 



< 2RnP 



oo. The first term is bounded by 

(fen) Jf(fen) 

-f— < 5, sup -{ — > M 

On j>l bn 



X. 



< 2RnP 



X, 



(fen) 



< (5, sup ■ 

with mo as in Condition (2.17). For An{S), we have by (2.10), 

limhmsupA„(5) = l[m2Rn* {[0,5] x ([0,M) 

For the second, limsiolimsnp^^^ Bn{6) = by Condition (2.17). This estabhshes (4.10). □ 

For completeness, the following lemma notes the continuity of the map Ti defined in (4.3). See 
also [33, 34]. 

Lemma 4.1. The mapping Ti : D^[0,oo) X M+([0, oo) X E) ^> M+([0,(X)) X E) given by 



X 



(fen) 



> M 



+ 2RnP 



X, 



(fen) 





bn 



< 6, sup 

i>mo+l 



(fen 



> M 



mo ^c^ 



0. 



Tim(/) = JJ f{x{u),v)m{du,dv), / e C+([0, oo) x E), 
is continuous at {x,m) whenever the function x{-) is continuous. 

Proof, (a) Suppose Xn xq in L'^[0,oo) (with respect to the Skorohod topology), where xq is 
continuous, and m„ A mo in M-|_([0, oo) x E). Let / G C^([0, oo) x E) with support contained in 
[0,R] X B. We show that Tim„(/) — )■ Timo(/). For n > 0, write fn{u,v) = f{xn{u),v). The 
are supported on x~ ^{[0,R]) X B, and x-^{[0,R]) = [0,xt{R)], where the right-continuous 

inverse of Xn- We now argue that the /„, n > 0, have a common compact support. Indeed, we have 



pointwise, so x^{R) 



-{R). Thus, for large n, [0,x^(i?)] x S C [0,a;^(i?) + 1] x 



without loss of generality, mo((?([0, j;o"(-^) + 1] x B)) = 0. Furthermore, /„ — > /o uniformly: 
suppose {un,Vn) — > (ito,^^o) ^ [Oj oo) X E. Then Xn{un) — > 2;o(no) since xq is continuous, and 
so f{xn{un),Vn) ^ /(xo(no),wo) by the continuity of /. Consequently, m„(/) mo{f) by [26, 
Lemma 8.2 (b)]. □ 
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